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Abstract. We discuss Quantum Chromodynamics calculations using the lattice regulator. 
The theory of the strong force is a cornerstone of the Standard Model of particle physics. 
We present USQCD collaboration results obtained on Argonne National Lab's Intrepid 
supercomputer that deepen our understanding of these fundamental theories of Nature and 
provide critical support to frontier particle physics experiments and phenomenology. 



1. Introduction 

Thanks to recent advances in computer power, software, and algorithms, lattice gauge theory 
calculations of Quantum Chromodynamics (QCD), or Lattice QCD for short, have come a long 
way since Wilson proposed the method[l^ 35 years ago. In this talk, we discuss results from 
lattice QCD and members of the USQCD collaboration that impact frontier particle physics 
research. 

QCD is the fundamental theory of the strong force responsible for the material world. Quarks 
and gluons, interacting via QCD bind together to form hadrons like the ubiquitous protons and 
neutrons and less well known, less massive vr and K mesons, in some ways like electrons and nuclei 
form atoms by exchanging the electromagnetic force carriers, the photons. QCD is different from 
Quantum Electrodynamics (QED) in some important ways. For one, the coupling constant, or 
strength of the QCD interaction, is large, unlike the electromagnetic charge e. Further, QCD 
is said to confine: quarks and gluons which carry the "color" charge of QCD are not free; they 
only exist in bound states that are neutral with respect to color charge. These eigenstates of 
the theory are the hadrons just mentioned. Finally, there are eight force carriers in QCD, called 
gluons, compared to one photon in QED, and they interact with each other as well as the quarks. 
For these reasons, QCD at the relatively low energy scale of a typical hadron mass, say 1 GeV, 
can not be studied in a perturbative framework. Being a relativistic quantum field theory, QCD 
must be regularized, or rigorously defined. This can be done using a discrete hypercubic lattice 
of points to describe space-time which we come back to in the next section. 

Discretizing the continuum theory leads naturally to numerical simulations. The non-abelian 
nature of QCD and the fact that an almost exact chiral symmetry of the quarks exists in Nature 
make it difficult to solve, even with the largest supercomputers. In this talk we describe the 
state-of-the-art simulations being done on some of the largest supercomputers in the world, the 
algorithms used for them, and the theoretical physics results gained from them. 



2. Dynamical fermion simulations 

The heart of any lattice gauge theory (see, e.g., [2]) calculation is its ensemble of gauge 
configurations, representing the gauge fields to be averaged over in the path integral for the 
quantum expectation value of some observable. For lattice QCD these configurations of "link" 
variables represent the gluons, for QED the photons. Feynman's path integral is to be taken over 
all possible configurations, or paths of the system, in configuration space. Since a configuration 
is the specification of the fields at every point in space-time, and since the (components of) fields 
take all possible values between ±00, the un-regulated path integral diverges. It is made finite by 
discretizing the continuum Lagrangian to produce one that exists on a four dimensional hyper- 
cubic lattice with spacing a. The continuum limit, a ^ 0, is taken practically by computing 
at several values of a and then extrapolating to 0. In practice the 4- volume is finite too, 
VT = (Nga)^ X Nta {Ns^t are the number of lattice sites in the space and time directions), so 
the infinite volume limit is taken in a similar way. 

There are several (quark) degrees of freedom at each lattice site and several (gluon) degrees 
of freedom on each link connecting neighboring sites in a typical simulation, and A^^ ~ 16 — 64, 
Nt ^ 32 — 200, so enumerating all possible configurations is impossible. We are therefore lead 
to Monte-Carlo simulations using importance sampling to estimate the path integraljl]. The 
configurations are chosen randomly, but with a weight equal to the exponential of their action. 
A typical configuration is generated from a Markov chain of configurations by performing an 
"update", or change, on the previous configuration. As long as the update algorithm satisfies 
basic constraints like detailed balance and ergodicity, in the limit of many updates, one is 
guaranteed to generate configurations with the correct weight, even if the starting configuration 
is not part of the equilibrium distribution [3]. Of course, there are many update algorithms 
that satisfy the basic requirements, so the trick is to find one that is optimal, that is, one 
that samples the configuration space as efficiently, but as cheaply, as possible. The USQCD 
collaboration has been using a combination of the hybrid monte-carlo|H [5] (HMC) and rational 
HMCjB] algorithms. The former is used for an even number of degenerate fermions, while the 
latter must be used for an odd number. The RHMC can also be used for even numbers of 
fermions. Here hybrid refers to the two steps in the update algorithm. First the fields are 
evolved through phase space by adding a fictitious conjugate momentum for the gauge field and 
numerically integrating the coupled Hamilton's equations of motion for the system in "monte- 
carlo time" . The trajectory length is defined as the number of integration steps times At, the 
integration time-step. Typical trajectory lengths have 0(1) time-units. This first step is often 
referred to as the molecular dynamics update. Since the integration is inexact (an error of order 
At^ accrues in the hamiltonian after each trajectory), a second metropolis accept /reject step is 
performed after each trajectory which makes the algorithm exact, i.e., the distribution of gauge 
configurations is the correct one in the limit that the number of updates is large. 

State-of-the-art calculations employ 2-1-1 flavors of dynamical light-mass quarks, meaning two 
degenerate up (u) and down (d) quarks, and one heavier strange quark (s). In Nature, the u and 
d quarks are nearly massless ((mu-|-m^)/2 ~ 3 — 5 MeV), while the s quark is somewhat heavier 
{nig ~ 90 — 100 MeV). Dynamical means the virtual pair creation/annihilation of the quarks, 
or vacuum polarization, effects on the gauge fields are included in the gauge configuration 
generation. In correlation functions, or expectation values, valence quarks appear, including 
heavier flavors of quarks. In Nature there are three, charm (c), bottom (b), and top (t). These 
quarks are so heavy (~ 1 — 175 GeV) that their vacuum polarization effects are ignored compared 
to the dominant statistical and systematic uncertainties in present lattice calculations. However, 
lattice results are nearing the one-percent precision level for some quantities, so groups are 
starting to consider including the lightest of the heavies, the c quark, in future configuration 
generation. Of the heavies, only charm can be simulated directly on present day lattices (a > 1/3 
GeV~^) because the dimensionless mass should satisfy ma <C 1 to avoid lattice spacing artifacts. 



There are ways to treat the heavy quarks so that even b quark observables may be computed 
accurately. Some of these are described later. The t quark is so massive that its decays can be 
treated accurately in perturbation theory. 



Table 1. Asqtad gauge configurations 
generated in 2008 by the MILC collab- 
oration on the ANL BG/P (Intrepid), 
"spc" is the lattice spacing in fm, and 
"traj" is the number of trajectories for 
the entire Markov chain. 
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Table 2. Same as Table [T] but 
for (5d) DWF, generated by LHPC, 
RBC, and UKQCD collaborations. 
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The quarks (fermions) always appear as the inverse of a large, sparse matrix known as 
the propagator because it describes how the quarks move through space-time. The matrix 
(D) is the finite lattice representation of the so-called Dirac operator which appears in the 
quantum mechanical equation of motion for a relativistic spin 1/2 particle. In the continuum 
the Dirac operator is proportional to a first order 4d covariant partial derivative. They only 
enter through D because the quantum fields themselves are anti-comuting, complex numbers 
known as Grassman variables which we do not know how to simulate on a computer. Since 
they appear quadratically in the action, sandwiching the Dirac operator, their contribution to 
the path integral can be computed analytically, leaving only factors of the propagator and a 
square root of the determinant of D for each flavor of quark in the action (see [7] , for example) . 
Evaluating the factors involving D dominates any calculation with fermions and boils down 
to inverting D, even for the determinant factors. This is because the det D factors can be 
re-exponentiated to the action by introducing fictitious bosons called pseudofermions. Their 
presence is felt through a force term in the equations of motion during the molecular dynamics 
update[4j. The cost due to the gauge part of the action is almost negligible by comparison, and 
most correlation functions of interest involve fermions. 

There is much freedom in defining D; it is only required to equal the continuum operator in 
the limit a — > and should also be gauge invariant to avoid unwanted mixing with unphysical 
operators. Thus, several discretizations are employed which are equivalent up to small errors 
proportional to positve powers of a (usually two). Within USQCD three types of fermion 
operators are being used in gauge field generation: a^-tadpole improved (Asqtad), domain wall 
and improved Wilson, or "clover" fermions. All three formally differ from the continuum 
theory by order errors, and they each have strengths and weaknesses. We discuss the first 
two, the latter is discussed in the talk by Edwards. 

2.1. Asqtad fermions 

Asqtad fermions are improved staggered fermions [HI l9l [TO] which ameliorate the flavor symmetry 
breaking O(a^) errors of un-improved staggered fermions. Staggered fermions reduce the number 
of fermion degrees of freedom due to lattice fermion doubling [TT] by diagonalizing the naive 
discretization of the Dirac operator and then eliminating 3 out of 4 of the spin components on 
each lattice site. In the limit a — > 0, the remaining spin degrees of freedom correspond to four 
degenerate Dirac fermions. The weight for a single fermion is then given by (det L») 1/4. If a / 0, 



the flavor symmetry (called taste-symmetry to distinguish it from a continuum flavor symmetry) 
is broken. It is this symmetry that is partially restored by the Asqtad formulation. There has 
been vigorous debate in the literature over the validity of the "rooting" procedure (whether it 
yields a continuum Dirac fermion, or a sick cousin when a 0)|12| [T3l [T¥l 115]). The primary 
advantages of staggered fermions are that they are relatively cheap to simulate and they have 
an exact axial symmetry for a ^ left over from the full SU(3)l x SU{3)r chiral symmetry of 
continuum QCD. 



3000 



2000 



/S = 7.46 mi=0.0018 64''xl44 



1000 



ANL Bd/P 
BigBen (XT3) 




100 200 300 

day of year 



DWF Production time history 

32"'x64x 16 ra =0.03 



= 0.004 QCDOC 

= 0.004 BG/P (2K,4K) 
ra, = 0.006 QCDOC 4K (UKQCD) 
ra, = 0.006 ANL BG/P (2K,4K) 
ra, = 0.008 ANL BG/P 2K 



48" X 64 X 16, ra. 



= 4000 - 

a 

s 




600 500 
(lays 



Figure 1. Time history of Asqtad 
fermion gauge configuration generation. 
The ensemble is described in the first 
row of Table [T] and corresponds to the 
lightest, most difficult quark mass. 



Figure 2. Time history 
gauge configuration generation, 
described in Table [2j 
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2.2. Domain wall fermions 

Domain wall fermions (DWF) were first formulated in [TB] as a non-perturbative method for 
defining and simulating chiral gauge theories, but were quickly reformulated for numerical 
efficiency In jT7] it was also recognized that DWF could be used to simulate vector gauge 
theories like QCD with exact chiral symmetry even when a 7^ 0. This is the principle advantage 
of DWF: chiral symmetry protects the theory from unwanted lattice artifacts and renders the 
lattice theory continuum-like even when a 7^ 0. The symmetry only becomes exact when the 
size of an extra 5th dimension, which provides the chiral symmetry to low-energy, 4 dimensional 
fermions stuck to the boundaries of the extra dimension, is taken to 00. For finite size the chiral 
symmetry is softly broken, inducing a small shift in the quark mass in addition to the bare 
mass appearing in the lagrangian. This shift, however, can be computed accurately from the 
simulation and its effects eliminated, or subsumed into the quark mass. The addition of an extra 
dimension, of course, increases the computational burden. The expectation is that the extra 
burden is partially offset by the continuum-like character of DWF since a may be relatively 
larger when continuum symmetries are intact. 

2.3. Early Science Program at ALCF 

In 2008, the USQCD collaboration was awarded Early Science Program (ESP) time on the 
Blue Gene/P Intrepid supercomputer at the Argonne (National Lab) Leadership Computing 



Facility (ALCF) under our three year INCITE allocation. The ESP time was primarily used for 
gauge field ensemble generation. The stability of Intrepid combined with USQCD's readiness to 
compute efficiently at the start of operation lead to spectacular gains in ensemble generation, 
advancing our science goals approximately two years ahead of schedule. The configurations are 
summarized in Tables [T] and [2] and the generation dramatized in Figs. [T] and |2j 

In the chiral limit of the light quarks, rrii ^ 0, D becomes singular and can not be inverted. 
Even though the physical u and d quarks are most likely massive, their masses are sufficiently 
small that present simulations are run with heavier masses, and expectation values are obtained 
by extrapolating to the physical point. Even though calculations could be done for lighter masses, 
even physical ones, the volumes would be too small to accommodate such light particles. A rule 
of thumb for QCD is that the mass of the pion (m,r) times the linear size of the box should 
be greater than 4 to avoid significant finite volume effects (see Tables [T] and 2]) . Recent work 
suggests this may be an underestimate of the needed size for some quantities [18] . 

Besides several values of the light quark mass for each type, the various ensembles of 
configurations summarized in Tables [T] and |2]refiect the exigencies of the two discretizations. The 
Asqtad configurations cover three lattice spacings (known as fine, super fine, and ultra fine) to 
bring the taste-breaking artifacts under control and together with existing coarser lattice spacing 
ensembles to take the continuum limit. The values of a and rrii are such that very large lattice 
sizes are required to simulate sufficiently large physical volume. For the DWF configurations, 
the lattice spacing is not as small, but is significantly smaller than a companion ensemble with 
a = 0.114 fm which together with these is being used to take the continuum limit[T9] (recall the 
leading artifacts for both go like a^). The size of the extra dimension is 16 lattice sites for each. 
The last row in Table [2] refers to a very demanding large volume, small mass simulation that 
was begun on Intrepid. However, the added size of the lattice and decreased mass have pushed 
the update algorithm beyond present capabilities, despite significant gains from tuning which 
we turn to next. Completion must await the next generation of leadership class computers. 



2.4- Tuning the rational hybrid monte carlo for domain wall fermions 

Improvements in algorithms have played a large role in lattice QCD simulations in the last 
few years. Most of the time used in generating lattices is related to including the full effects 
of light quarks in the Feynman path integral. The quarks enter through the determinant of 
the Dirac operator, which is a function of all the gluon fields on the lattice, and which also 
prohibits two quarks from occupying the same state, as required by the Pauli exclusion principle. 
The determinant is a positive definite quantity, so it can be added to an importance sampling 
measure. The standard way to do this, by writing the determinant as a bosonic path integral, 
requires all eigenvalues be positive. This is not the case here. However, the eigenvalues come in 
related pairs, which taken together are positive. 

To handle a single quark correctly, the determinant of the product D D'^ is used, which is 
manifestly positive eigenvalue-by-eigenvalue. On can show that det(DD^) = det(L') det(-D), so 
that the simulation of a single quark requires that (det(D D^^)^/^ be known. Using a rational 
approximation for (det(-D 0^)^/"^ yields an accurate way to include single quarks into simulations, 
which is vital for the 2-1-1 fiavor simulations that most accurately represent the physical world, 
where the up and down quarks are taken as light and degenerate in mass, and the strange quark 
is much heavier [6j. 

The RHMC algorithm, as currently implemented, uses a rational approximation for the effects 
of a single quark species, along with many other improvements, to allow the molecular dynamics 
steps to be as large as possible. We would like to detail a few of the recent improvements used 
in the RHMC algorithm. 

In every step of the molecular dynamics part of the RHMC algorithm, the propagation of 
quarks in the gluon background must be determined. This is the most numerically expensive 



part of the calculation and is generally done by a Krylov space solver. As quark masses get 
smaller (the physically interesting region) the solutions are ever more expensive. One feature 
added to current RHMC implementations is to minimize the force from light quarks via the 
insertion of Hasenbush preconditioning masses |20j. The light quark forces are still expensive 
to evaluate, but they are not evaluated as often, since they are small. This allows one to use 
an integrator with multiple time scales: expensive to calculate forces are made small and only 
evaluated on a relatively large time scale, while large, cheaper forces are evaluated often. Use 
of multiple time scale integrators has allowed us to target our computing time most effectively. 

The improvements mentioned above have been in place for the last 2-3 years. For the quark 
masses and voluems we were simulating with 2-3 years ago, speed-ups of a factor of 5 or more 
were seen. This was a sufficiently large gain that we then moved to smaller quark masses which 
were not possible without the improved RHMC algorithm. In the last year, with the availability 
of the large BG/L and BG/P installations, we have been doing simulations at much smaller 
quark masses, and on much larger volumes. In our our most demanding simulation to date (last 
row. Table |2]), we observed that it was taking over 3 times as long as expected. This DWF 
simulation has a pion mass of roughly 240 MeV which is small by current standards, but still 
well above the physical value of 140 MeV. 

It is important to know whether we had reached a fundamental limit in the capability of 
the RHMC, related to physics at such a light quark mass, or whether the particular choice 
of RHMC parameters was not optimal. Fortunately, we have found the latter to be the case. 
For this light quark mass, we introduced an additional Hasenbush preconditioning mass and an 
additional time step in our multi-time step integrator. These steps should have helped to keep 
the forces from the light quarks small and allowed us to use a large time step for them. Initially 
these improvements did not help with the acceptance. However, we then decreased the stopping 
condition for our Krylov solvers by a factor of 100, to 3 x 10~^. With this increased accuracy, we 
were able to increase the molecular dynamics step size and regain the factor of three we had lost 
for this simulation. Our system has O(IO^) degrees of freedom and, for a good acceptance, the 
change in the energy at the end of a molecular dynamics trajectory must be an 0(1) quantity. 
Thus it is not surprising that such a small residual is required in our Krylov solvers. 

For our current 2-|-l flavor simulations, we have found it optimal to rewrite the path integral 

as 

Z - J [dA] ^ '^^*^("^') ^ f detP(m^) 



detP(mx)/ \det'D{ms 



detP(l) 



exp{-Sg{A)} (1) 



where T){m) = D\mi)D{mi) and mi = m„ = m^. Each ratio of determinants involves a separate 
pseudofermion field and the fermions are integrated on three different time scales. A represents 
the gluon field, and Sg the action of its kinetic term. 



3. Results 

3.1. Improved staggered fermions 

The MILC Asqtad configurations are being used by several groups within USQCD to compute 
fundamental properties of QCD and the SM. The gluon topological charge susceptibility from 
MILC is shown in Fig.[3| When plotted with the non-Goldstone pion mass rriT^i, it tends to vanish 
in the combined chiral and continuum limits, as it should. The pion leptonic decay constant is 
shown in Fig. |4j accompanied by a multi-parameter fit of the data to chiral perturbation theory. 
The data are fit simultaneously with corresponding pion mass measurements and extrapolated 
in quark mass to the point where the pion mass agrees with its value from experiment. This 
fixes the value of the light quark mass at the "physical point" . Not only does this allow all other 
calculated quantities to be evaluated at the physical point, but the quark mass determined this 
way, once renormalized, is a fundamental parameter of the SM. 
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Figure 3. Gluon topological charge suscep- 
tibility (MILC Collaboration), ro is the Som- 
mer parameter. 
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Figure 4. Pion decay constant (MILC 
Collaboration). 



A huge experimental effort is currently devoted to the study of the decays and mixings of 
mesons with one heavy and one light quark with the ambitious aim of observing phenomena 
beyond the SM. The experimental measurements must be complemented, however, by precise 
lattice QCD calculations of the corresponding weak matrix elements. 

The HPQCD calculation of the Dg meson decay constant is shown in Fig. [5] The new 
calculations on the super and ultra fine lattices are quite consistent with their earlier prediction 
which is 2.3 standard deviations lower than the 2009 measurement from the CLEO experiment. 
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The Fermilab/MILC group has been calculating heavy/light matrix elements on the Asqtad 
lattices, using Asqtad light valence quarks and clover improved Fermilab heavy valence quarks. 
They recently calculated the B meson decays B D*lui and B ttIui [22j to determine the 
Cabibbo-Kobayashi-Maskawa (CKM) matrix elements V^b and V^b^ important SM parameters in 
the search for new physics. These matrix elements will be calculated on the ultra fine ensemble, 
reducing the uncertainties due to the heavy quark discretization, the largest errors in this set of 
calculations, by a factor of two. 

With the advent of Leadership Class computers which can handle small a, large size lattices, 
HPQCD is investigating simulating a relativistic b quark (m^ ~ 4 GeV) directly on the lattice. 
Figure |6] shows the result for the heavy-strange meson decay constant computed with highly 
improved staggered quarks (HISQ) for both the heavy and s valence quarks. As o decreases, the 
decay constant trends toward the non-relativistic heavy quark (NRQCD) result. While inspiring, 
calculations at the b quark mass will require the next generation of computers. HPQCD has 
also used HISQ valence quarks on the super fine and ultra fine lattices to compute the c quark 
mass. 

3.2. domain wall fermions 

The LHPC/RBC/UKQCD DWF gauge configurations listed in Table |2j along with a previously 
generated coarser ensemble with a = 0.114 fm, are being used to study the light-strange K 
mesons (kaons) |19 | \2'i \ [2^ and to investigate nucleon structure from QCPpS]. The former is 
an important compliment to the heavy-light meson investigation in the search for CP violation 
and new physics while the latter addresses an important aspect of QCD and is discussed in the 
talk on nuclear physics and lattice QCD by Edwards. 

Figure [7] depicts the most accurate calculation to date of the kaon B parameter Bk which 
also provides a constraint on the vertex of the unitary triangle (geometrical representation of 
the CKM matrix) through the experimental measurement of K meson mixing since CP violation 
in the SM arises only from a single complex phase in the CKM matrix. The figure reveals how 
tricky it is to extrapolate to the physical point. Heavier light-quark points exhibit markedly 
linear behavior which belies the true behavior in the chiral limit. The current precision on Bk is 
roughly 6 % from all sources, but a large fraction is due to the lattice spacing error which is about 
4%. With the new fine DWF lattices, this error can be reduced significantly, constraining the 
SM even further. Preliminary results for Bk computed on the fine lattices are shown in Fig. |8j 
The RBC and UKQCD collaborations are currently working to use both calculations to establish 
the continuum limit of Bk- Indicating its importance to the particle physics community, two 
other groups in USQCD are computing Bk in the "mixed-action" approach. Sharpe, et al., use 
the Asqtad lattices with HYP-smeared staggered valence quarks [26j while Laiho, et al., use the 
Asqtad lattices but with DWF valence quarks [27] . 

Another important precision test of the SM comes from unitarity of the CKM matrix, for 
example |KidP + |KtsP + |^6p = 1 rnust hold; any deviation signals new physics. To extract \Vus\ 
from meson decays whose underlying quark decay process is s — > it, a non-perturbative form 
factor /o(0) must be determined. Figure [o] shows /o(q^) computed with 2+1 flavors of DWF [23]. 
The original calculation extrapolated to = as required by the kinematics imposed by the 
lattice size and periodic boundary conditions. New points, computed using twisted boundary 
conditions and allowing simulations (nearly) at = 0, show excellent agreement. As with Bk, 
analysis is underway using the fine lattices to take the continuum limit of /o(0). 

Like the Asqtad program, a main goal of the DWF work is to investigate QCD near the 
chiral limit and confront experiment. On the coarse DWF ensemble, an important finding was 
that SU(3) chiral perturbation theory for 2+1 flavors of quarks does not work well when the s 
quark mass is near its physical value of roughly 100 MeV and only terms in the next-to-leading- 
order in the expansion are kept. It was much more accurate to use SU(2) chiral perturbation 
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Figure 7. The kaon B parameter computed 
on the coarse DWF ensembles [3]. The 
(extrapolated) value at the physical point is 
shown at the lower left of the figure. 
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Figure 8. The kaon B parameter 
computed on the fine DWF ensembles 
(RBC and UKQCD collaborations). 



theory to extrapolate the light quarks to their physical point |19j . In fact, this was done for the 
calculation of Bk discussed above. Confirming this behavior on the fine lattices is underway. 
It also suggests running simulations at even smaller quark masses. We have started such a run 
where the pion mass ~ 240 MeV (last row in Table [2| . However even with the prodigious power 
of Intrepid, the simulation is out of reach and must await the next generation of Leadership 
Class machines. In the meantime, one can back off of a, lower the quark mass, and still have a 
suitably large volume. The difficulty is that for DWF, increasing a also increases the unwanted 
chiral symmetry breaking effects from finite 5th dimension. 

To tame the chiral symmetry breaking effects, we add an irrelevant term to the action, 
or equivalently, an auxiliary determinant (AuxDet for short) to the weight in the path 
integral |28| \29\ [30] . The AuxDet allows for larger a without the extra chiral symmetry breaking 
effects. DWF simulations with m-,^ = 180 and 240 MeV are being run now at the ALCF. 
Besides strengthening the investigation of the chiral limit of QCD begun on the conventional 
DWF lattices, these large volume, small mass simulations open up an interesting new challenge: 
simulating decays of K mesons to two-pion final states. Important outcomes of such calculations 
are the direct CP violation parameter e' and the long standing AI = 1/2 rule for K decays. 



Figure 10 shows preliminary results computed on the 16^, coarse DWF ensemble for the isospin 
0, two TT meson scattering correlation function. Such calculations are extremely demanding since 
they involve so-called "disconnected" diagrams where the quark propagators are not connected 
with each other. The disconnected diagrams require huge statistics to resolve. Hence, the 
K ^ nn decay project is to run on Intrepid in 2009 as the AuxDet lattices become available. 

4. Outlook 

Enabled by large machines like Intrepid, the gains achieved in lattice QCD simulations were not 
even dreamt of a few years ago. As the continuum and chiral limits are approached, lattice QCD 
calculations are living up to the promise of solving the incredibly rich, complex, and beautiful 
theory of the strong interactions, a fundamental theory that is a cornerstone of particle and 
nuclear physics. Still there is work to be done to reach these difficult limits, and we can't help 
but look to the next generation of computers. In the next five years, with sufficient but realistic 
resources (O(lOO)'s of Teraflop- years), QCD calculations may be done at the physical point with 
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Figure 10. Correlation function for vrvr 
Figure 9. Form factor foiq^) needed to extract scattering. Different curves correspond 
\Vus\ from experimental measurement of the K ^ four distinct quark contractions in the 
meson decay K ttIu, computed on the coarse correlation function (RBC and UKQCD 
ensemble[23]. collaborations). 



several lattice spacings and sizes to take the continuum and infinite volume limits. The Large 
Hadron Collider (LHC) Era is set to begin. Electro-weak symmetry breaking, supersymmetry, 
and beyond the SM phenomenology offer new challenges for lattice gauge theory. 

5. References 



[1 
[2] 
[3] 
[4] 
[5] 
[6] 
[7] 
[8 
[9 

[lo: 
[11 

[12 
[13 
[14 

[15 
[16 
[17 

[is: 

[19 

[20' 
[21 
[22 
[23 
[24 
[25' 
[26 
[27 
[28 
[29 
[30' 



Wilson K G 1974 Phys. Rev. DIO 2445-2459 

Creutz M Cambridge, Uk: Univ. Pr. ( 1983) 169 P. ( Cambridge Monographs On Mathematical Physics) 
Toussaint D 1989 Comput. Phys. Commun. 56 69-92 

Gottlieb S A, Liu W, Toussaint D, Renken R L and Sugar R L 1987 Phys. Rev. D35 2531-2542 
Duane S, Kennedy A D, Pendleton B J and Roweth D 1987 Phys. Lett. B195 216-222 
Clark M A and Kennedy A D 2007 Phys. Rev. Lett. 98 051601 {Preprint he p-lat/0608015J 
Peskin M E and Schroeder D V Reading, USA: Addison- Wesley (1995) 842 p 

Lepage G P 1999 Phys. Rev. D59 074502 {Preprint hep-lat/9809157 ) 

Orginos K and Toussaint D (MILC) 1999 Phys. Rev. D59 014501 {Preprint [hep-lat/9805009ll 



Orginos K, Toussaint D and Sugar R L (MILC) 1999 Phys. Rev. D60 054503 (Preph7i^ |hep-lat/9903032 \ 
Nielsen H B and Ninomiya M 1981 Phys. Lett. B105 219 

Creutz M 2007 Phys. Lett. B649 230-234 {Preprint hep-lat/0701018) 



Bernard C, Golterman M, Shamir Y and Sharpe S R 2008 Phys. Rev. D77 1145 04 (Preprin t |0711 . 0696 1 

Bernard C, Golterman M and Shamir Y 2008 Phys. Rev. D77 074505 ( Preprint [0712.2560p 

Creutz M 2008 Phys. Rev. D78 078501 {Preprint 0805.1350) 

Kaplan D B 1992 Phys. Lett. B288 342-347 {Preprint hep-lat/9206013i 

Shamir Y 1993 Nud. Phys. B406 90-106 {Preprint hep-lat/9303005 ) 



Yamazaki T et al. (RBC-I- UKQCD) 2008 Phys. Rev. Lett. 100 171602 {Preprint 801.4016 1 

AUton C et al. (RBC-UKQCD) 2008 Phys. Rev. D78 114509 {Preprint 0804. 0473^) 

Hasenbusch M 2001 Phys. Lett. B519 177-182 {Preprint hep-lat/0107019) 

Bernard C et al. 2009 Phys. Re v. D79 014506 (Prepnwt 0808 . 2519) 

Bailey J A et al. 2008 (Preprint |0811 . 3640) ' ' 

Boyle P A et al. 2008 Phys. Rev. Lett. 100 141601 (Preprint "0710 . 5136) 

Antonio D J et al. (RBC) 2008 Phys. Rev. Lett. 100 032001 {Preprint h ep-ph/0702042 ) 

Syritsyn S N et al. 2008 PoS LATTICE2008 169 {Preprint 0903.3063]) 

Bae T et al. 2008 PoS LATTICE2008 275 {Preprint 080 9.1220) 

Aubin C, Laiho J and Van de Wa ter R S 2009 {Preprint 09 05 . 3947| ) 

Vranas P M 1999 {Preprint hep-lat/0001006) 

Fukaya H et al. (JLQCD) 2006 Phys. Rev. D74 094505 {Preprint [hep-lat/0607020l) 



Renfrew D, Blum T, Christ N, Mawhinney R and Vranas P 2009 {Preprint 0902.25871 



Acknowledgements. I thank Bob Mawhinney for help with the discussion on tuning the 
RHMC algorithm. I also thank Peter Boyle, Norman Christ, Carleton Detar, Steve Gottlieb, 
Chulwoo Jung, Andreas Kronfcld, Junko Shigemitsu, Enno Scholz, and Bob Sugar for preparing 
various results covered in these proceedings. 



